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METHOD AND SYSTEM FOR THE ERROR RESILIENT 
TRANSMISSION OF PREDICTIVELY ENCODED SIGNALS 

Field of the Invention 
[0001] The present invention relates to error resilient transmission of 

predictively encoded sequences generally, and more particularly to those techniques 
of transmission that use multiple mutually correlated versions of the signal to 
increase error resilience. 

Background of the Invention 
[0002] Currently, a range of communication channels (including the Internet 

and wireless channels) can be modeled as erasure channels. It is well known that the 
transmission of multiple mutually correlated but distinct versions of a given signal 
enhance the probability of the high fidelity reconstruction of the given signal. The 
underlying concept of such techniques is that the erasure channels have a more or 
less independent probability of failure, thus ensuring that the probability of the 
correct reception of at least a few channels is high even when the correct reception of 
a single channel is low. 

[0003] The transmission of several exact replicas of a given signal 

over a channel may result in an explosion of bandwidth required to transmit the 

signal over the channel. As illustrated in Figure 1, a solution to this problem 

involves transmitting several correlated but not exact replicas of the signal over the 

channel. When such a technique is employed the reconstruction fidelity of the signal 

improves with the number of copies of the signal that are correctly received. There 

are numerous publications relating to the above-mentioned signal transmission 

problem. Related inventions disclosed in literature include United States Patent 

Number 6330370 issued to Goyal, et al. for the "Multiple description transform 

coding of images by the use of optimal transforms of arbitrary dimension" and 
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United States Patent Number 6542554 issued to Jafarkhani, et al. for "Multiple 
description trellis coded quantization". 

[0004] Although such techniques have been shown to achieve good 

performance in practice, their extension to the case of predictively encoded signals, 
i.e., signal coding with memory, is not well understood. If the source signal 
sequence consists of predictively encoded symbols, as in the case of video coding, 
the additional problem of predictive mismatch is encountered. 
[0005] Predictive mismatch refers to the scenario in which the reconstruction 

of the predictor symbol at the decoder is different from the predictor symbol used at 
the encoder. This causes the reconstruction of the predicted symbol to be erroneous 
and this effect propagates through the sequence. In the context of transmission of 
multiple correlated signals over independent channels, since the number of channel 
failures in transmitting the predictor symbol is unknown at the encoder, the decoder 
reconstruction of the predictor cannot be accurately reproduced at the encoder 
leading to a mismatch. Thus, the key issue to be solved in the case of predictively 
encoded symbols is circumventing the problem of predictive mismatch. 
[0006] Layered coding is a coding technique that uses multiple channels for 

the transmission of predictively coded information in the presence of channel 
erasures. However, layered coding requires that one particular channel, which is 
typically referred to as the 'base channel', has to be received with fidelity in order 
for a scheme to succeed. The present invention does not rely on such restrictions and 
therefore possesses greater general applicability. The resultant problem when two 
correlated representations are employed is illustrated in Figure 2. 
[0007] The residual-of-residuals (RoR) technique addresses this problem by 

sending a correction signal that partially removes predictive mismatch. However, the 
increase in rate incurred in transmitting the correction signal is quite high. United 
States patent Number 6556624 to Orchard, et al. for a "Method and apparatus for 
accomplishing multiple description coding for video" discloses an invention that 
utilizes a similar technique for video transmission. It must be noted that this 
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technique, in direct contrast to the present invention, requires the transmission of at 
least one additional correction signal to accomplish its objectives. 
[0008] The invention disclosed in United States Patent Number 6611530 to 

Apostolopoulos for "Video communication using multiple streams" discloses a 
method for transmitting predictively encoded video using multiple streams, such that 
errors in one or more streams does not destroy the entire video stream but only 
reduces the play back frame rate. Conversly, the utilization of the present invention 
does not lead to a reduction in the frame rate in the event of the loss of one or more 
channels. 

[0009] The present invention eliminates predictive mismatch while avoiding 

transmission of a correction signal. 

Summary of the Invention 
[0010] The disclosed invention provides method, system and computer 

program products for introducing controlled correlation among multiple redundant 
representations of a predictively encoded signal in order to reduce predictive 
mismatch at the receiver when any given sub-set of the multiple representations is 
received. 

[0011] An embodiment of the present invention comprises an encoding 

system for the transmission of predictively encoded signals over a plurality of 
channels. The system comprises a signal source, wherein the signal source transmits 
a signal over a channel comprising at least two transmission channels. Further, at 
least two signal adding means are implemented wherein each adding means receives 
the signal transmitted from the signal source and forms a coefficient by subtracting a 
differing predictive value from the signal and outputting a respective coefficient 
value. At least two signal quantizing means are used for receiving differing 
coefficient values from the signal adding means, wherein the quantizing means 
perform the operations of quantizing the received coefficient values and outputting 
the resultant values. Lastly, at least two encoding means are implemented for 
receiving the coefficients from the signal quantizing means, wherein the encoder 
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means performs the operation of transforming the received coefficients and 
transmitting at least two resulting scalar coefficient values, where the number of the 
transmitted scalar coefficients is less than the number of coefficients values 
generated by the transformation, to a decoding system embodied within the present 
invention. 

[0012] Further embodied is a decoding system for the decoding of 

predictively encoded signals that are received over a plurality of channels, wherein 
more than one subset of the channels can be used to reconstruct an approximate 
version of the originally encoded signal. The decoding system comprises a central 
decoding means, wherein the central decoder means receives a first set of coefficient 
values from at least two channels. The central decoding means comprises at least 
two signal adding means for respectively receiving the scalar coefficient values and 
then adding at least one value which is a transform of the difference between two 
predictive values to the coefficient value in order to generate a second set of 
coefficient values. Further, the central decoder comprises a decoding means for 
receiving the coefficient values transmitted from the signal adding means. The 
decoding means transforms and transmits a second set of coefficient values to an 
additional signal adding means, wherein the signal adding means further generates 
and outputs a third set of coefficient values; the third set of coefficients being used to 
reconstruct an approximate version of an encoded sequence of symbols. 

Brief Description of the Drawings 
[0013] The accompanying drawings illustrate one or more embodiments of 

the invention and, together with the written description, serve to explain the 
principles of the invention. Wherever possible, the same reference numbers are used 
throughout the drawings to refer to the same or like elements of an embodiment, and 
wherein: 

[0014] Figure 1 is pictorial diagram demonstrating the transmission of 

multiple correlated descriptions over erasure channels. 
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[0015] Figure 2 is a pictorial diagram demonstrating the problem of 

predictive mismatch. 

[0016] Figure 3 is a diagram illustrating an exemplary embodiment of the 

current invention. 

[0017] Figure 4 is a diagram illustrating an embodiment of an encoder that 

may be utilized within embodiments of the present invention. 

[0018] Figure 5 is a diagram illustrating an embodiment of a decoder system 

that may be utilized within embodiments of the present invention. 

[0019] Figure 5A is a diagram illustrating a central decoder that may be 

utilized within embodiments of the present invention. 

[0020] Figure 5B is a diagram illustrating a side decoder that may be utilized 

within embodiments of the present invention. 

[0021] Figure 6 is a flow diagram illustrating a method for encoding a 

predictively encoded signal. 

[0022] Figure 7 is a flow diagram is a flow diagram illustrating a method for 

decoding a predictively encoded signal. 



Detailed Description 
[0023] One or more exemplary embodiments of the invention are described 

below in detail. The disclosed embodiments are intended to be illustrative only since 
numerous modifications and variations therein will be apparent to those of ordinary 
skill in the art. In reference to the drawings, like numbers will indicate like parts 
continuously throughout the views. 

[0024] Disclosed are embodiments of the present invention in which a 

baseline scheme of multiple description correlating transforms (MDCT) are modified 
to address the problem of predictive mismatch as applicable to transmission of 
predictively encoded sequences. However, it should be understood that the 
technique can be modified to apply to many other schemes for transmitting multiple 
correlated descriptions over erasure channels. The disclosed embodiments of the 
present invention use two representations of a signal, however, the present invention 
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can be modified to be made applicable to cases when more than two redundant 
representations are utilized. 

[0025] The present invention is initially described in reference to Figure 3. 

Figure 3 illustrates a system 300 for the introduction of controlled correlation among 
multiple redundant representations of a predictively encoded signal. The present 
invention avoids predictive mismatch at a receiver when any given sub-set of the 
multiple representations is received. The illustrated embodiment of the present 
invention comprises an encoding device 305 and a decoding device 310. The 
encoding device 305 and the decoding device 310 are in communication via 
transmission channels 315 and 320. 

[0026] It is assumed in the following descriptions that a signal comprising a 

two-coefficient vector X is being transmitted over two channels using the prediction 
information from previously encoded signals. In this context, the following notation 
is introduced wherein Po is the prediction value of X if information from both 
channels is received, Pi is the prediction value if only the signal from a first channel 
is received and P2 is the prediction value if only the signal from a second channel is 
received. [A]& is representative of the vector A after it is quantized with a step size A. 

'a b 



Further, Tis representative of a 2x2 matrix 



, which is used to introduce a 



c d 

controlled amount of correlation between the coefficients that are to be transmitted 
given the source signal statistics. In the specified context if A is a 2x1 vector, [aJ is 
then used to denote the first coefficient of the vector TA. 
[0027] In general T may not be orthogonal, thus it may be better to 

implement the transform and the quantization steps in tandem by the use lifting 
techniques to aid in the reduction quantization error. In this context, f refers to a 
discrete version of T thereby combing the transform and the quantization steps. 
[0028] Further, 9? represents the general linear minimum mean squared error 

(LMMSE) predictor of the coefficients of X. Thus, if [A]\ is the first coefficient of 
vector .4, then [Af represents the LMMSE estimate of both coefficients of X given 



YOR920030575US1 



6 



[A]\. The resultant X is communicated by the transmittal of the scalar coefficients 
Si= {X - P 0 } T A l + {P 0 - P } } T A l , 5j g 9? on a first channel and S 2 = 



400 for transmitting predictively encoded signals over a plurality of channels. The 
system comprises a signal source 402, wherein the signal source 402 transmits a 
signal So over a channel comprising at least two transmission channels. The signal 
So is received at at least two signal adding means 412, each signal adding means 412 
receives the signal So and forms a coefficient by subtracting a differing predictive 
value 408, 410 from the signal So and outputs the respective coefficient values to a 
signal quantizing means 414. 

[0030] The quantizing means 414 performs the operation of quantizing the 

received coefficient values and outputting a resultant value to an encoding means 
416, wherein the encoding means 416 performs the operation of transforming the 
received coefficient value and transmitting the respective resulting scalar coefficient 

values Si= {X - P 0 £ , + {P 0 - P } f A } , 5, € on a first channel and S 2 = 

{X-P 0 } T A2 +{P 0 -P 2 } T ^ 2 ^2 € ^ on a second channel. 

[0031] When both scalar coefficients are received at a decoder 500 (as 

illustrated in Figure 5), reconstruction of the encoded signal is accomplished as 
follows. 



{X - P 0 Y A 2 + {P 0 - P 2 Y Aa , S 2 <= on a second channel. 

[0029] Figure 4 illustrates an exemplary embodiment of an encoding system 



E = T 



$2 ~{^0 - ^}a,2 



(Equation 1) 



(Equation 2) 



(Equation 3) 



YOR920030575US1 



7 



X 0 =P 0 +E = [X^ 



(Equation 4) 



[0032] The computation of E makes use of the invertibility of the discrete- 

domain transform f and the equality of equation 4 follows since P Q e AZ 2 . The key 

concept being P 0? P\ and P 2 are all present at the decoder 500 if both scalar 
coefficient values Sj and S2 are received. For this reason, the decoder 500 can 
compute E completely from S\, S2 and information that is already available at the 
decoder 500. Summarily, if both S\ and S2 are received at the decoder 500, it is 
possible to reconstruct the signal X such that the only distortion in the transmitted 
signal is the quantization distortion. 

[0033] However, in the instance when only one of the two channels is 

received the reconstruction hinges on the fact that the quasi-linear discrete-domain 
transform f approximates a linear continuous-domain transform Tfov small 
quantization step size. For this reason, T and T are related as follows: 



f(x) = [7; [T 2 [r 3 x]A]A]A (Equation 5) 

T = T X T 2 T 3 

[0034] Then the following holds 

|r(jc)-r(x| <(l + maxj4H+|l + a&|})y xeAZ 2 (Equation 6) 

[0035] Thus 

S x ={x-P Q Z, + {P 0 -P£ A (Equation?) 

= {x - pX 
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[0036] Where the first step follows from equation 6 and the second step 

follows from the linearity of the transform T. Now, assuming only a signal Si is 
received, the one-channel reconstruction can be achieved by the equation 

X = {X-P$l x +P x (Equation 8) 



[0037] The one-channel reconstruction obtained is distorted since only one 

description of the residual {X- P\) is used in the reconstruction of the signal X, 
However, predictive mismatch is avoided since the predictor used at the encoder (Pi 
in the above case) is the same as the predictor present at the decoder 500. 
[0038] As referenced above, further embodiment of the present invention 

comprises a decoding system 500 for decoding a predictively encoded signal that is 
transmitted over a plurality of channels, wherein more than one subset of the 
channels can be used to reconstruct an approximate version of the encoded signal. 
As illustrated in Figures 5 and 5 A, the decoding system 500 comprises a central 
decoding means 506, wherein the central decoder means 506 receives a first set of 
scalar coefficient values Si and S 2 from at least two channels. Further, the central 
decoding means 506 comprises at least two signal adding means 510, 512 (Figure 
5 A) for respectively receiving the scalar coefficient values and adding at least one 
predictive value transform 508, 514 (which are transforms of the difference between 
two predictive values) to the coefficient value in order to generate and transmit a 
second set of coefficient values. A decoding means 516 is used to receive the 
coefficient values transmitted from the adding means 510, 512 wherein the decoding 
means 516 transforms and transmits the second set of coefficient values to a signal 
adding means 520 where the signal adding means 520 in conjunction with a 
predictive value transform 518 further generate a third set of coefficient values 
which are reconstructed as the signal Xo. 

[0039] With reference to Figures 5 and 5B, further aspects of this 

embodiment comprise two side decoders 502 and 504, wherein the side decoders 
receive the first set of coefficient values Si or S 2 transmitted from the encoder 305. 
The side decoders 502 and 504 comprise a decoding means 522 for transforming the 
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coefficients received from the encoder 305. Subsequently, the decoding means 522 
transmits the transformed coefficients to a signal adding means 526, wherein a 
second set of coefficients is generated by adding a predictive value 524 of the first 
coefficient values to the first coefficient and the signal Xi or X2 are reconstructed 
depending upon the scalar coefficient Si or S2 transmitted from the encoder 305 that 
are processed by the side decoders 502, 504. 

[0040] Figure 6, illustrates a method for the encoding and transmittal of 

predictively encoded signals over a plurality of channels. The method comprises the 
steps of receiving a source signal over a channel at step 602, wherein the channel 
comprises at least two channels, and at step 604 forming a coefficient by subtracting 
a differing predictive value from the signal and outputting the respective coefficient 
values. At step 606 the coefficient values output at step 604 are received at at least 
two quantizing means, wherein the quantizing means perform the operations of 
quantizing the received coefficient values and outputting the resultant values. At 
step 608, the coefficients are transmitted from the quantizing means to at least two 
encoders, wherein the encoders perform the operation of transforming the received 
coefficients and at step 610 transmitting the resulting scalar coefficient values Si 

= {X-PoY AA +{P 0 -PiY AA ,Si e9? on a first channel and S 2 = 

{X - P Q } T A 2 + {P 0 - P 2 } T A 2 , 5 2 e 9? on a second channel. 

[0041] Figure 7 illustrates another embodiment of the preset invention that 

comprises a method for decoding a predictively encoded signal transmitted over a 
plurality of channels, such that more than one subset of the channels can be used to 
reconstruct an approximate version of the sequence. We find that at step 702 
coefficients from more than one channel are received from an encoding source. At 
step 704, a second set of coefficients is generated by taking a function of the 
difference of predicted values generated from information from two different subsets 
of the total number of channels. A third set of coefficients is next generated at step 
706 by either adding a function of the received coefficients from step 702 to 
corresponding coefficients from step 704 or directly copying a function of the 
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coefficients from step 702 without any addition, provided that at least one coefficient 
in this third set is generated by the process of addition. Finally, at step 708 an 
approximate version of the transmitted signal is reconstructed by further decoding 
the third set of coefficients from step 706. 

[0042] The present invention additionally embodies a computer program 

product that includes a computer readable medium useable by a processor, the 
medium having stored thereon a sequence of instructions which, when executed by 
the processor causes the processor to decode a predictively encoded signal that has 
been transmitted over a plurality of channels, such that more than one subset of the 
channels can be used to reconstruct an approximate version of the sequence. 
[0043] The computer program product executes the steps of receiving 

coefficients from at least two channels and subsequently forming a second set of 
coefficients in which each coefficient is generated by taking a function of the 
difference of predicted values generated using information from two different 
subsets of the total number of channels. The computer program product next 
generates a third set of coefficients by either adding a function of the received 
coefficients to corresponding coefficients from the second set of coefficients or 
directly copying a function of the coefficients from the received coefficients without 
any addition, provided that at least one coefficient in this third set is generated by the 
process of addition. Lastly, the computer program product reconstructs an 
approximate version of the transmitted signal by further decoding the third set of 
coefficients. 

[0044] A yet further embodiment of the present invention comprises a 

computer program product that includes a computer readable medium useable by a 
processor, the medium having stored thereon a sequence of instructions which, when 
executed by the processor, causes the processor to encode and transmit predictively 
encoded signals over a plurality of channels. The computer program product 
executes the steps of receiving a source signal over a channel and forming a 
coefficient by subtracting a differing predictive value from the signal and outputting 
the respective coefficient values. Next the computer program product performs the 
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operation of quantizing the received coefficient values and outputting the resultant 
scalar coefficient value S 1 = {X - P Q }^ , + {P 0 - P x } T A , , S x e 5R on a first channel and 

S 2 = {X - P 0 }[ 2 + {P 0 - P 2 } T A 2 , S 2 e 9? second channel. 

[0045] It will be apparent to those skilled in the art that various modifications 

and variations can be made in the present invention without departing from the scope 
or spirit of the invention. Other embodiments of the invention will be apparent to 
those skilled in the art from consideration of the specification and practice of the 
invention disclosed herein. It is intended that the specification and examples be 
considered as exemplary only, with a true scope and spirit of the invention being 
indicated by the following claims. 
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